Wav2Vec2 is a speech recognition model developed by Facebook. It learns speech representations from raw audio through self-supervised learning and is fine-tuned on the LibriSpeech dataset to achieve high-accuracy speech transcription.
Speech Recognition
Transformers English